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Abstract 

We study the problem of minimizing a nonnegative separable concave function over 
a compact feasible set. We approximate this problem to within a factor of 1 + e by a 
piecewise-linear minimization problem over the same feasible set. Our main result is 
that when the feasible set is a polyhedron, the number of resulting pieces is polyno- 
mial in the input size of the polyhedron and linear in 1/e. For many practical concave 
cost problems, the resulting piecewise-linear cost problem can be formulated as a well- 
studied discrete optimization problem. As a result, a variety of polynomial-time exact 
algorithms, approximation algorithms, and polynomial-time heuristics for discrete op- 
timization problems immediately yield fully polynomial-time approximation schemes, 
approximation algorithms, and polynomial-time heuristics for the corresponding con- 
cave cost problems. 

We illustrate our approach on two problems. For the concave cost multicommodity 
flow problem, we devise a new heuristic and study its performance using computational 
experiments. We are able to approximately solve significantly larger test instances than 
previously possible, and obtain solutions on average within 4.27% of optimality. For 
the concave cost facility location problem, we obtain a new 1.4991 + e approximation 
algorithm. 

1 Introduction 

Minimizing a nonnegative separable concave function over a polyhedron arises frequently in 
fields such as transportation, logistics, telecommunications, and supply chain management. 
In a typical application, the polyhedral feasible set arises due to network structure, capacity 
requirements, and other constraints, while the concave costs arise due to economies of scale, 
volume discounts, and other practical factors [see e.g. GP90]. The concave functions can 
be nonlinear, piecewise-linear with many pieces, or more generally given by an oracle. 
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A natural approach for solving such a problem is to approximate each concave function 
by a piecewise-linear function, and then reformulate the resulting problem as a discrete 
optimization problem. Often this transformation can be carried out in a way that preserves 
problem structure, making it possible to apply existing discrete optimization techniques to 
the resulting problem. A wide variety of techniques is available for these problems, including 
heuristics [e.g. BMW89, HH98], integer programming methods [e.g. AtaOl, OW03], and 
approximation algorithms [e.g. JMM + 03]. 

For this approach to be efficient, we need to be able to approximate the concave cost 
problem by a single piecewise-linear cost problem that meets two competing requirements. 
On one hand, the approximation should employ few pieces so that the resulting problem 
will have small input size. On the other hand, the approximation should be precise enough 
that by solving the resulting problem we would obtain an acceptable approximate solution 
to the original problem. 

With this purpose in mind, we introduce a method for approximating a concave cost 
problem by a piecewise-linear cost problem that provides a 1 + e approximation in terms 
of optimal cost, and yields a bound on the number of resulting pieces that is polynomial in 
the input size of the feasible polyhedron and linear in 1/e. Previously, no such polynomial 
bounds were known, even if we allow any dependence on 1 /e. 

Our bound implies that polynomial-time exact algorithms, approximation algorithms, 
and polynomial-time heuristics for many discrete optimization problems immediately yield 
fully polynomial-time approximation schemes, approximation algorithms, and polynomial- 
time heuristics for the corresponding concave cost problems. We illustrate this result by 
obtaining a new heuristic for the concave cost multicommodity flow problem, and a new 
approximation algorithm for the concave cost facility location problem. 

Under suitable technical assumptions, our method can be generalized to efficiently ap- 
proximate the objective function of a maximization or minimization problem over a general 
feasible set, as long as the objective is nonnegative, separable, and concave. In fact, our 
approach is not limited to optimization problems. It is potentially applicable for approx- 
imating problems in dynamic programming, algorithmic game theory, and other settings 
where new solutions methods become available when switching from concave to piecewise- 
linear functions. 

1.1 Previous Work 

Piecewise-linear approximations are used in a variety of contexts in science and engineering, 
and the literature on them is expansive. Here we limit ourselves to previous results on 
approximating a separable concave function in the context of an optimization problem. 

Geoffrion [Geo77] obtains several general results on approximating objective functions. 
One of the settings he considers is the minimization of a separable concave function over a 
general feasible set. He derives conditions under which a piecewise-linear approximation of 
the objective achieves the smallest possible absolute error for a given number of pieces. 

Thakur [Tha78] considers the maximization of a separable concave function over a con- 
vex set defined by separable constraints. He approximates both the objective and constraint 
functions, and bounds the absolute error when using a given number of pieces in terms of 
feasible set parameters, the maximum values of the first and second derivatives of the 
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functions, and certain dual optimal solutions. 

Rosen and Pardalos [RP86] consider the minimization of a quadratic concave function 
over a polyhedron. They reduce the problem to a separable one, and then approximate 
the resulting univariate concave functions. The authors derive a bound on the number of 
pieces needed to guarantee a given approximation error in terms of objective function and 
feasible polyhedron parameters. They use a nonstandard definition of approximation error, 
dividing by a scale factor that is at least the maximum minus the minimum of the concave 
function over the feasible polyhedron. Pardalos and Kovoor [PK90] specialize this result 
to the minimization of a quadratic concave function over one linear constraint subject to 
upper and lower bounds on the variables. 

Giider and Morris [GM94] study the maximization of a separable quadratic concave 
function over a polyhedron. They approximate the objective functions, and bound the 
number of pieces needed to guarantee a given absolute error in terms of function parameters 
and the lengths of the intervals on which the functions are approximated. 

Kontogiorgis [KonOO] also studies the maximization of a separable concave function over 
a polyhedron. He approximates the objective functions, and uses techniques from numerical 
analysis to bound the absolute error when using a given number of pieces in terms of the 
maximum values of the second derivatives of the functions and the lengths of the intervals 
on which the functions are approximated. 

Each of these prior results differs from ours in that, when the goal is to obtain a 1 + e 
approximation, they do not provide a bound on the number of pieces that is polynomial in 
the input size of the original problem, even if we allow any dependence on 1/e. 

Meyerson et al. [MMPOO] remark, in the context of the single-sink concave cost mul- 
ticommodity flow problem, that a "tight" approximation could be computed. Munagala 
[Mun03] states, in the same context, that an approximation of arbitrary precision could be 
obtained with a polynomial number of pieces. They do not mention specific bounds, or any 
details on how to do so. 

Hajiaghayi et al. [HMM03] and Mahdian et al. [MYZ06] consider the unit demand 
concave cost facility location problem, and employ an exact reduction by interpolating the 
concave functions at points 1,2,... ,m, where m is the number of customers. The size of 
the resulting problem is polynomial in the size of the original problem, but the approach is 
limited to problems with unit demand. 

1.2 Our Results 

In Section 2, we introduce our piecewise-linear approximation approach, on the basis of 
a minimization problem with a compact feasible set in R" and a nonnegative separable 
concave function that is nondecreasing. In this section, we assume that the problem has 
an optimal solution x* = (x*, . . . , x* n ) with x* £ {0} U [Zj, Ui] and < k < u; L . To obtain a 
1 + e approximation, we need only 1 + |~log 1+4e+4e 2 j 1 ] pieces for each concave function. As 
e — > 0, the number of pieces behaves as ^ log j 1 . 

In Section 2.1, we show that any piecewise-linear approach requires at least ^(^j log j 1 ) 
pieces to approximate a certain function to within 1 + e on [Zj,Uj]. Note that for any fixed 
e, the number of pieces required by our approach is within a constant factor of this lower 
bound. It is an interesting open question to find tighter upper and lower bounds on the 
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number of pieces as e — > 0. In Section 2.2, we extend our approximation approach to 
objective functions that are not monotone and feasible sets that are not contained in M™ . 

In Sections 3 and 3.1, we obtain the main result of this paper. When the feasible set 
is a polyhedron and the cost function is nonnegative separable concave, we can obtain a 

1 + e approximation with a number of pieces that is polynomial in the input size of the 
feasible polyhedron and linear in 1/e. We first obtain a result for polyhedra in M™ and 
nondecreasing cost functions in Section 3, and then derive the general result in Section 3.1. 

In Section 4, we show how our piecewise-linear approximation approach can be combined 
with algorithms for discrete optimization problems to obtain new algorithms for problems 
with concave costs. We use a well-known integer programming formulation that often 
enables us to write piecewise-linear problems as discrete optimization problems in a way 
that preserves problem structure. 

In Section 5, we illustrate our method on the concave cost multicommodity flow problem. 
We derive considerably smaller bounds on the number of required pieces than in the general 
case. Using the formulation from Section 4, the resulting discrete optimization problem 
can be written as a fixed charge multicommodity flow problem. This enables us to devise 
a new heuristic for concave cost multicommodity flow by combining our piecewise-linear 
approximation approach with a dual ascent method for fixed charge multicommodity flow 
due to Balakrishnan et al. [BMW89]. 

In Section 5.1, we report on computational experiments. The new heuristic is able to 
solve large-scale test problems to within 4.27% of optimality, on average. The concave cost 
problems have up to 80 nodes, 1,580 edges, 6,320 commodities, and 9,985,600 variables. 
These problems are, to the best of our knowledge, significantly larger than previously 
solved concave cost multicommodity flow problems, whether approximately or exactly. A 
brief review of the literature on concave cost flows can be found in Sections 5 and 5.1. 

In Section 6, we illustrate our method on the concave cost facility location problem. 
Combining a 1.4991-approximation algorithm for the classical facility location problem due 
to Byrka [Byr07] with our approach, we obtain a 1.4991 + e approximation algorithm for 
concave cost facility location. Previously, the lowest approximation ratio for this problem 
was that of a 3 + e approximation algorithm due to Mahdian and Pal [MP03]. In the 
second author's Ph.D. thesis [Str08], we obtain a number of other algorithms for concave 
cost problems, including a 1.61-approximation algorithm for concave cost facility location. 
Independently, Romeijn et al. [RSSZ10] developed 1.61 and 1.52-approximation algorithms 
for this problem. A brief review of the literature on concave cost facility location can be 
found in Section 6. 

2 General Feasible Sets 

We examine the general concave cost minimization problem 

Z* = min {4>{x) : x € X} , (1) 

defined by a compact feasible set X C M™ and a nondecreasing separable concave function 
4> : — > R + . Let x = (x\, . . . , x n ) and 4>{x) = Y17=i <M x j)i an d assume that the functions 
(f>i are nonnegative. The feasible set need not be convex or connected — for example, it could 
be the feasible set of an integer program. 
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In this section, we impose the following technical assumption. Let [n] = {1, . . . , n}. 

Assumption 1. The problem has an optimal solution x* = (x*, . . . , x* ) and bounds Zj and 
Ui with < k < Ui such that x* G {0} U [Zj, Uj] for i G [n]. 

Let e > 0. To approximate problem (1) to within a factor of 1 + e, we approximate 
each function fa by a piecewise- linear function ipi : R + — >■ R + . Each function ^ consists of 
Pi + 1 pieces, with Pj = |~log 1+e j 1 ] , and is defined by the coefficients 

a? = #(Zi(l + €)'), P6{0,...,P,}, (2a) 

/f = & (Z,(l + e)P ) - + e)P S ? , V € {0, . . . , P}. (2b) 

If the derivative <^ (Zj(l + e) p ) does not exist, we take the right derivative, that is s P = 
lim. ri _ 5> (z i (i +e )p)+ • ^ ne r isht derivative always exists at points in (0, +oo) 

since fa is concave on [0, +oo). We proceed in this way throughout the paper when the 
derivative does not exist. 

Each coefficient pair (sf,/f) defines a line with nonnegative slope s p and y-intercept 
ff, which is tangent to the graph of fa at the point Zj(l + e) p . For Xj > 0, the function 
is defined by the lower envelope of these lines: 

ipi(xi) = min{/f + s\xi : p = 0, . . . , P}. (3) 

We let ipi(0) = fa(0) and ip(x) = X^=i^*( x *)- Substituting for <p in problem (1), we 
obtain the piecewise-linear cost problem 

Z\ = min{^(x) : x G X}. (4) 

Next, we prove that this problem provides a 1 + e approximation for problem (1). The 
following proof has an intuitive geometric interpretation, but does not yield a tight analysis 
of the approximation guarantee. A tight analysis will follow. 

Lemma 1. Z\ < Z% < (1 + e)Z\. 

Proof. Let X — \^X *^ j • • • j X *) be an optimal solution to problem (4); an optimal solution 
exists since ip(x) is concave and X is compact. Fix an i 6 [n], and note that for each 
p G {0, . . . , Pi}, the line ff + s P Xj is tangent from above to the graph of fa(xi). Hence 
fa{x*) < min{/f + sfx* :p = 0,...,Pi} = ipi(x*). Therefore, Z* < </>(x*) < i/>(x*) = Z\. 

Conversely, let x* be an optimal solution of problem (1) that satisfies Assumption 1. It 
suffices to show that ipi(x*) < (1 + e)fa(x*) for i G [n]. If x* = 0, then the inequality holds. 
Otherwise, let p = [log 1+£ , and note that p G {0, ...,P} and ^ G [(1 + e) p , (1 + e) p+1 ). 
Because fa is concave, nonnegative, and nondecreasing, 

< ff + s p iX * < ff + 8^(1 + ef +1 

= ff + s%(l + e)(l + ef < (1 + e) {ff + sfk(l + ef) (5) 
= {l + e)fa Ml + eY)<{l + e)fa(x*). 

(See Figure 1 for an illustration.) Therefore, Z\ < ip(x*) < (1 + e)(f)(x*) = (1 + e)Z*. □ 



5 




Figure 1: Illustration of the proof of Lemma 1. Observe that the height of any point inside 
the box with the bold lower left and upper right corners exceeds the height of the box's 
lower left corner by at most a factor of e. 



We now present a tight analysis. 
Theorem 1. Z* < Z\ < 1+ ^ £+1 Z*. The approximation guarantee of i s tight. 

Proof. We have shown that Z* < Z\ in Lemma 1. Fix an % G [n], and consider the 
approximation ratio achieved on [li,Ui] when approximating fa by ipi. If faili) = 0, then 
fa{%i) = for all Xi > 0, and we have a trivial case. If fa(k) > 0, then fa(xi) > for all 
Xi > 0, and the ratio is 

min{l + 7 : ipi(xi) < (1 + j)fa(xi) for Xj G [/,, itj]} 

= max{V'i(xj)/^j(xj) : x, G (6) 

We derive an upper bound of on this ratio, and then construct a family of functions 

that, when taken as </>j, yield ratios converging to this upper bound. 

Without loss of generality, we assume Zj = 1 and Uj = (1 + e) Pi . The approximation 
ratio achieved on is the highest of the approximation ratios on each of the intervals 

[1, 1 + e], . . . , [(1 + e) Pi_1 , (1 + e) Pi }. By scaling along the x-axis, it is enough to consider 
only the interval [1, 1 + e], and therefore we can assume that ipi is given by the two tangents 
to the graph of <f>{ at 1 and 1 + e. Suppose these tangents have slopes a and c respectively. 
We can assume that (f>i(0) = 0, and that fa is linear with slope a on [0, 1] and linear with 
slope c on [1 + e, +oo). By scaling along the y-axis, we can assume that a = 1. 

We upper bound the approximation ratio between ipi and fa by the ratio between and 
a new function <pi that has <Pi(0) = and consists of 3 linear pieces with slopes 1 > b > c 
on [0, 1], [1, 1 + e], and [1 + e, +oo) respectively. The approximation ratio between ipi and 
ifi can be viewed as a function of b and c. Let 1 + £ be a point on the interval [1, 1 + e]. 
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Figure 2: Illustration of the proof of Theorem 1. 



We are interested in the following maximization problem with respect to b, c, and £ : 

max{^i(l + OM(l + : 1 > b > c > 0,0 < £ < e}. (7) 

Since ipi consists of 3 linear pieces, while ipi is given by the lower envelope of two tangents, 
we have 

¥>i(l + = ! + &£, &(z i ) = min{l + £,l + &e-c(e-0}- (8) 
(See Figure 2 for an illustration.) 

Since £ < e, we can assume that c = 0. Next, since we seek to find b and £ that maximize 

^(1 + min{l+£,l + 6e} . / 1+g 1 + be ) 



we can assume £ is such that = j^f, which yields £ = e6. Substituting, we now seek 
to maximize tt-t?, and we find that the maximum is achieved at b = - — \== and equals 



. Therefore, is an approximation guarantee for our approach. 

Finally, we show that this guarantee is tight. First, let 4>i be the new function ifi with 
b and c taken to have the values that yield the guarantee above. If we were to use our 
approach to approximate (pi, the tangents at 1 and 1 + e would have slopes b and c, instead 
of the desired 1 and c, since 4>i lacks derivatives at 1 and 1 + e, and our approach uses the 
derivative from the right when the derivative does not exist. We can cause our approach 
to produce tangents at 1 and 1 + e with slopes 1 and c by taking a sufficiently small ( and 
letting 4>i have its first breakpoint at 1 + ( instead of 1. When ( — > 0, the approximation 
ratio achieved by our approach for fa converges to the guarantee above. □ 

To compare the tight approximation guarantee with that provided by Lemma 1, note 
that i±^EEI < 1 + 1 for e > 0. Moreover, since 1+ ^ i + T ->• 1 and ^ 1+ ^ TT ->• \ as e ->• 0, it 
follows that 1 + | is the lowest ratio of the form 1 + | that is guaranteed by our approach. 
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Equivalently, instead of an approximation guarantee of using 1 + [log 1+e j 1 ] 

pieces, we can obtain a guarantee of 1 + e using only 1 + |~log 1+4e+4e 2 j 1 ] pieces. Note 
that log 1+4e+4e 2 f = log(1+4£+4 ^) logf , and as e -> 0, we have log(1+4e+4£ 2) -> +oo and 
iog(i+4e+4e a ) ~~ ^ ^' Therefore, as e — >■ 0, the number of pieces behaves as ^ log j-. 

This bound on the number of pieces enables us to apply our approximation approach to 
practical concave cost problems. In Section 3, we will exploit the logarithmic dependence 
on j 1 of this bound to derive polynomial bounds on the number of pieces for problems with 
polyhedral feasible sets. 



2.1 A Lower Bound on the Number of Pieces 

Since the approximation guarantee of Theorem 1 is tight, 1 + [logi+4 e +4 e 2 j 1 ] is a lower 
bound on the number of pieces needed to guarantee a 1 + e approximation when using 
the approach of equations (2)-(4). In this section, we establish a lower bound on the 
number of pieces needed to guarantee a 1 + e approximation when using any piecewise- 
linear approximation approach. 

First, we show that by limiting ourselves to approaches that use piecewise-linear func- 
tions whose pieces are tangent to the graph of the original function, we increase the number 
of needed pieces by at most a factor of 3. 

Let fa : R+ —7- R + be a nondecreasing concave function, which we are interested in 
approximating on an interval [li,ui\ with < k < u%. Assume that fa(xi) > for all 
Xi > 0; if (f>i(xi) = for some x\ > 0, then fa must be zero everywhere on [0, +oo), and 
we have a trivial case. Also let : R + — > M.+ be a piecewise-linear function with Q pieces 
that approximates fa on [Zj,«j] to within a factor of 1 + e, that is < ^'^'j < 1 + e for 
Xi £ [k,Ui\. We are not imposing any other assumptions on ipt; in particular it need not be 
continuous, and its pieces need not be tangent to the graph of fa. 

Lemma 2. The function fa can be approximated on [Zj, Uj\ to within a factor of 1 + e by a 
piecewise-linear function tpi : M + — > M + that has at most 3Q pieces and whose every piece 
is tangent to the graph of fa . 

Proof. First, we translate the pieces of ipi that are strictly above fa down, and the pieces 
strictly below up, until they intersect fa. Let the modified function be tp'^, clearly tp^ still 
provides a 1 + e approximation for fa on [Zj,Uj]. 

For each piece of we proceed as follows. Let ff + s\xi be the line defining the piece, 
and [a p , bp] be the interval covered by the piece on the x-axis. Without loss of generality, 
assume that [a p ,b p ] C [Zj,Uj]. If the piece is tangent to fa, we take it as one of the pieces 
composing ifi, ensuring that ifi provides a 1 + e approximation for fa on [a p ,b p ]. 

If the piece is not tangent, it must intersect fa at either one or two points. If the piece 
intersects at two points £i and £2, then the points partition the interval [a p ,6 p ] into three 
subintervals: [a p ,£i], on which the piece is above fa; [£1,62]) on which the piece is below fa; 
and [£2,frp]> on which the piece is again above fa. If there is one intersection point, we can 
partition [a p ,6 p ] similarly, except that one or two of the subintervals would be empty. 

On the interval [a p ,£i], the line ff + sfxj is above fa and provides a 1 + e approximation 
for fa. We take the tangent to fa at £1 as one of the pieces composing <pi, ensuring that tpi 
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provides a 1 + e approximation for fa on [a p ,£i]. Similarly, we take the tangent to fa at £2 
as one of the pieces, ensuring a 1 + e approximation on [£2, b p ]. 

Next, note that on the interval [£i,£2]> the line ff + s?Xi is below fa and provides a 
1 + e approximation for fa. Therefore, on [£1,^2]; the scaled line (1 + e)ff + (1 + e)s?Xi is 
above fa and still provides a 1 + e approximation for fa. Since the original line is below 
and the scaled line above fa, there is an e* with < e* < e such that, on [£1,^2]; the line 
(1 + e*)ff + (1 + e*)s?Xi is above fa and intersects it at one or more points. If this line is 
tangent, we take it as one of the pieces that define </?j. If the line is not tangent, it must 
intersect fa at exactly one point and we take the tangent to fa at £' as one of the pieces. 
In either case, we have ensured that ipi provides a 1 + e approximation for fa on [£1,^2]- 

Since u2 =1 [a p ,6 p ] = the constructed function ipi provides a 1 + e approximation 

for fa on [k,Ui]. Since for each piece of ipi, we introduced at most 3 pieces, ipi has at most 
3Q pieces. □ 

Next, we establish a lower bound on the number of pieces needed to approximate the 
square root function to within a factor of 1 + e by a piecewise-linear function that has its 
every piece tangent to the graph of the original function. Let fa(xi) = yfxl, and let <pi be 
a piecewise-linear function that approximates fa to within a factor of 1 + e on [/j,itj] and 
whose every piece is tangent to the graph of fa. 

To write the lower bounds in this section in a more intuitive way, we define the function 
7 (e) = (l + 2e(2 + e) + 2(1 + e)y/e(2 + e)) 2 . As e ->• 0, 7(e) behaves as 1 + \/32e, with the 
other terms vanishing because they contain higher powers of e. In particular, 1 + V32e < 
7(e) < 1 + 16^e for < e < i. 

Lemma 3. The function ipi must contain at least |~log 7 ( e ) j-\ pieces. As e — > 0, this lower 
bound behaves as -7= logy 1 . 

Proof. Given a point £0 £ [h,Ui\, a tangent to fa at £0 guarantees a 1 + e approximation 
on an interval extending to the left and right of £o- Let us denote this interval by [£o(l + 
<5i)>£o(l + 62)]- The values of 8\ and 82 can be found by solving with respect to 8 the 
equation 

fa(£o) + W^o) = (1 + e)fa((l + <%) (10a) 

Vlo + ^0^= = (1 + <0V(l + <%> ( 10b ) 
2y?o 

£0 + *£o + \S% = (1 + e) 2 (l + «%. (10c) 

This is simply a quadratic equation with respect to 8, and solving it yields 81 = 2e(2 + e) — 
2(l + e)y/e(2 + e) and 8 2 = 2e(2 + e) + 2(1 + e) y / e(2 + e). Let £1 = £o(l + <5i), and note that 
[£o(l + 6i), £0(1 + ^2)] = [£1, T+lr^ 1 ]- Therefore, the tangent provides a 1 + e approximation 
on the interval 



\f 1+82 f~ 




[^i + sfi_ 





£1, (l + 2e(2 + e) + 2(1 + e) ^ e(2 + e)f & 



[£i,7(e)6]- (11) 



Since 7(e) does not depend on £1, the best way to obtain a 1 + e approximation on [/j, m] 
is to iteratively introduce tangents that provide approximations on intervals of the form 
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[k,j(e)li], [y(e)li,j 2 (e)li], [y 2 (e)k, j 3 (e)li], . . . , until the entire interval [k,Ui] is covered. It 
immediately follows that we need at least |~log 7 ( e ) j 1 ] pieces to approximate fa on [k,Ui\. 

This bound can also be written as |" log7 ( e ) log f 1 ] . As e — > 0, we have i og7 ( e ) — > +°o 
and i^-jp^fj — > 1, and therefore, the lower bound behaves as ^= log j 1 . □ 

Combining Lemmas 2 and 3, we immediately obtain a lower bound for any piecewise- 
linear approximation approach. Let ipi : R+ — > R+ be a piecewise-linear function that 
approximates fa(xi) = JlTi to within a factor of 1 + e on [/j,itj]. Note that ipi need not be 
continuous or have its pieces tangent to the graph of fa. 

Theorem 2. TTie function ipi must contain at least |~| l°g7(e) f 1 ] pieces. As e — > 0, this 
lower bound behaves as , 1 log j 1 . 

This lower bound is within a factor of 2+ i g^i^7+4e a ) °^ tne numDer °f pieces required by 
our approach. This implies that for fixed e, the number of pieces required by our approach 
is within a constant factor of the best possible. As e — > 0, the number of pieces needed by 
our approach converges to a factor of ^ 2 ^ e 8e = 0(-^) of the lower bound. An interesting 
open question is to find tighter upper and lower bounds on the number of pieces as e — > 0. 



2.2 Extensions 

Our approximation approach applies to a broader class of problems. In this section, we 
generalize our results to objective functions that are not monotone and feasible sets that 
are not contained in R" . Consider the problem 

Z* l2 = min{<£(x) : x G X}, (12) 

defined by a compact feasible set IC1™ and a separable concave function (f> : Y — > R + . 
The feasible set X need not be convex or connected, and the set Y can be any convex 
set in R n that contains X. Let <j){x) = Y17=i ^i( x i)^ an d assume that the functions <fo are 
nonnegative. 

Instead of Assumption 1, we impose the following assumption. Let proj^,. Y denote the 
projection of Y on Xj, and note that proj^ Y is the domain of fa. 

Assumption 2. Problem (12) has an optimal solution x* = (x\, . . . , x*), bounds 
with [aij,/3j] C proj a , i Y, and bounds k,Ui with < k < u\ such that x* G {aj,/3j} U ([ojj + 
k, ai + Ui] n [j3i - Ui,f3i - k]) for i € [n]. 

Next, we apply the approach of equations (2)-(4) to approximate problem (12) to within 
a factor of 1 + e. We approximate each concave function fa by a piecewise-linear function 
ipi. Assume that the interval [aij + Zj, + itj] n \j3i — Ui, fa — lj\ is nonempty; if this interval 
is empty, we have a trivial case. For convenience, we define a new pair of bounds 

l[ = m&x{li, j3i - Ui - ai}, u\ = minj^, fa - k - ai}. (13) 

Note that [a, + k, ai + Ui] D [fa - Ui,fa - k\ = [ai + l[ J a i + u'J = [fa - u' i: fa - /•]. Since 
fa is concave, there is a point ^* G [aj,/3j] such that fa is nondecreasing on [aj,£*] and 
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nonincreasing on [£*,/%]. We do not have to compute £* in order to approximate fa. 
Instead, we simply introduce tangents starting from on + 1[ and advancing to the right, and 
starting from fa — l\ and advancing to the left. 

More specifically, we introduce tangents starting from ai+l[ only if the slope at this point 
is nonnegative. We introduce tangents at ai+l[, aj+/-(l+4e+4e 2 ), . . . , aj+^(l+4e+4e 2 ) < ^ i , 
where Qi is largest integer such that on + /-(l + 4e + 4e 2 )^ i < a« + v! i and the slope at 
asi + 1^(1 + 4e + 4e 2 )^ 1 is nonnegative. 

Let Q = minjctj + u'^di + 1^(1 + 4e + 4e 2 ) < ^ 1+1 }. If fa has a nonnegative slope at Q, 
we introduce an additional tangent at Q. If the slope at Q is negative, we find the largest 
integer rj such that the slope at Qj + /-(l + 4e + 4e 2 )^ l (l + e) ri is nonnegative, and introduce 
an additional tangent at that point. Since the slope is nonnegative at aj + ^(l + 4e + 4e 2 ) < ^ i , 
we have n > 0, and since Q < (*i + ^(1 + 4e + 4e 2 ) Ql+1 < at + /-(l + 4e + 4e 2 ) Ql (l + e) 4 , 
we have r« < 3. Let the tangents introduced starting from aj + l\ have slopes s°, . . . , 
and y- intercepts /P, . . . , 

We introduce tangents starting from /3j — ^ only if the slope at this point is nonpositive. 
We proceed in the same way as with the tangents starting from ctj + l\, and let these 
tangents have slopes sf 1+2 , . . . , s ^+^+ 3 an d y-intercepts /f I+2 , . . . , f® i+Ri+3 . Also let 

Pi = Qi + Ri + 3. 

If «j and fa are the endpoints of proj^. Y, for Xj G (ai,fa), the function ^ is given by 
iPi( Xi ) = min{/f + s^x, : p = 0, P}, (14) 

while for x« G {a;,/?;}, we let ipi(xi) = fa(xi). 

If «j and /3j are in the interior of proj^ y, we introduce two more tangents at a.{ and 
fa, with slopes sf l+1 , sf I+2 and y-intercepts ff i+1 , fi' t+2 , and let tpi(xi) = min{/? + sfxj : 
p = 0, Pj + 2}. If one of c^, is in the interior and the other is an endpoint, we use the 
corresponding approach in each case. 

We now replace the objective function fax) in problem (12) with the new objective 
function ip(x) = Y^i=i^i( x i) ■• obtaining the piecewise-linear cost problem 

Z x * 5 = min^x) : x G X}. (15) 

The number of pieces used to approximate each concave function fa in each direction is 
at most 2 + [log 1+ 4 e+4e 2 j 1 ] , and therefore the total number of pieces used for each function 
is at most 4 + 2 |~log 1+4e+4e 2 j 1 ] . As e — > 0, this bound behaves as ^ log j 1 . 

It remains to show that problem (15) provides a 1 + e approximation for problem (12), 
which we do by employing Lemma 1 and Theorem 1. 

Lemma 4. Z{ 2 < Z\ h < (1 + e)Z{ 2 . 

Proof. Clearly, Z* 2 < Z* 5 . To prove the inequality's other side, let x* be an optimal solution 
to problem (12) that satisfies Assumption 2. We will show that fa(x*) < (1 + e)fa(x*) for 
i G [n]. If x* G {ai,fa} then ipi(x*) = fa(x*). If x* G" {cti,fa}, we must have x* G [aj + 
l\, ai+u'j] = [fa—u'^fa — l 1 ^. Since fa is concave, it is nondecreasing on [aj, x*], nonincreasing 
on [x*,fa], or both. Without loss of generality, assume that fa is nondecreasing on [aj,x*]. 

Due to the way we introduced tangents starting from oti + l\, it follows that x* G 
[ccj + Ci]- We divide this interval into two subintervals, [ccj + ^, + /•(! + 4e + 4e 2 ) < ^ i ] 
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and [an + + 4e + Ae 2 ) Q \Ci] ■ If < G [«i + ^ + + 4e + 4e 2 ) Ql ] , then since fa is 
nondecreasing on this interval, ipi(x*) < (1 + e)<^>j(x*) follows directly from Theorem 1. 

Ifx* G [ai + ^(l + 4e + 4e 2 ) Q %Ci], additional steps are needed, since fa is not necessarily 
nondecreasing on this interval. If fa has a nonnegative slope at Q, then we introduced a 
tangent at £j, and ipi(x*) < (l + e)fa(x*) again follows from Theorem 1. If the slope at d is 
negative, we introduced a tangent at q« + 1^(1 + 4e + 4e 2 )'^ i (l + e) ri . Since r$ is the largest 
integer such that the slope at on + + 4e + 4e 2 )^'(l + e) ri is nonnegative, and the slope 
at x* is also nonnegative, x* G [a, + Z-(l + 4e + 4e 2 ) Qi ,a; + ^(1 + 4e + 4e 2 ) Q *(l + e) ri+1 ]. 

We now distinguish two cases. Ifx* G [ai+^(l+4e+4e 2 ) Q % ai+/-(l+4e+4e 2 ) Ql (l+e) r *] , 
then since </>j is nondecreasing on this interval, ipi(x*) < (1 + e)fa(x*) follows by Theorem 
1. Ifx* G [an + + 4e + 4e 2 ) Ql (l + e) r % a* + + 4e + 4e 2 ) Ql (l + e) r ' +1 ] , note that the 
right endpoint of this interval is 1 + e times farther from ccj than the left endpoint. Since 
0i is nondecreasing from the left endpoint to x*, and we introduced a tangent at the left 
endpoint, ipi(x*) < (1 + e)^(x*) follows by Lemma 1. 

Taken together, the above cases imply that Z\ b < ip(x*) < (l + e)<p(x*) = {\+e)Z{ 2 . □ 

We conclude this section with two further extensions: 

1) We can use secants instead of tangents, in which case we require on the order of 
one function evaluation per piece, and do not need to evaluate the derivative. The 
secant approach may be preferable in computational applications where derivatives 
are difficult to compute. 

2) The results in this section can be adapted to apply to concave maximization problems. 

3 Polyhedral Feasible Sets 

In this section and Section 3.1, we obtain the main result of this paper by applying our 
approximation approach to concave cost problems with polyhedral feasible sets. We will 
employ the polyhedral structure of the feasible set to eliminate the quantities k and Uj from 
the bound on the number of pieces, and obtain a bound that is polynomial in the input 
size of the concave cost problem and linear in l/e. 

Let X = {x : Ax < 6, x > 0} be a nonempty rational polyhedron defined by a matrix 
A G Q mxn and a vector b G Q m . Let <fi : — > R + be a nondecreasing separable concave 
function, with 4>{x) = Y17=i 4>i{ x i) an d each function fa nonnegative. We consider the 
problem 

Z{ % = mimj>(x) : Ax <b,x> 0}. (16) 

Following standard practice, we define the size of rational numbers, vectors, and matrices 
as the number of bits needed to represent them [see e.g. KV02]. More specifically, for an 
integer r, let size(r) = 1 + [~log 2 (|r| + 1)] ; for a rational number r = ^ with r 2 > 0, and r\ 
and r 2 coprime integers, let size(r) = size(ri) + size(r2); and for a rational vector or matrix 
M G Q pxq with elements rriij, let size(M) = pq + Ya=i Sj=i size(my). 

We take the input size of problem (16) to be the input size of the feasible polyhedron, 
size(A)+size(6). Assume that each function fa is given by an oracle that returns the function 
value fa(xi) and derivative <^(xj) in time 0(1). When the concave functions are given in 
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other ways than through oracles, the input size of problem (16) is at least size(^4) + size(6), 
and therefore our bound applies in those cases as well. 

We will use the following classical result that bounds the size of a polyhedron's vertices 
in terms of the size of the constraint matrix and right-hand side vector that define the 
polyhedron [see e.g. KV02]. Let U(A,b) = 4(size(A) + size(6) + 2n 2 + 3n). 

Lemma 5. If x' = . . . , x' n ) is a vertex of X, then each of its components has size(x^) < 
U(A,b). 

To approximate problem (16), we replace each concave function fa with a piecewise- 
linear function tpi as described in equations (2)-(4). To obtain each function tpi, we take 

l > = 2 U ( A M- 1 - 1 ' * = 2U(Ab) ' 1 ~ (17) 

and Pi = [log 1+4e+4e 2 j 1 ] , and introduce Pj + 1 tangents to fa at Zj, Zj(l + 4e + 4e 2 ), . . . , k(l + 
4e + 4e 2 ) p< . The resulting piecewise-linear cost problem is 

Z\ % = min{V>(x) : Ax < b, x > 0}. (18) 

The number of pieces used to approximate each function fa is 



1 + 



H 

2U(A,b) _ U{A,b) 



2U(A,b) 



log 2 (l + 4e + 4e 2 ) 



(19) 



As e — > 0, this bound behaves as 4 ( log2 ' e ) e = 2(iog 2 'e)e - Therefore, the obtained bound is 
polynomial in the size of the input and linear in 1/e. The time needed to compute the 
piecewise-linear approximation is also polynomial in the size of the input and linear in 1 /e. 
Specifically, we can compute all the quantities Zj and Ui in 0(U(A,b)), and then compute 
the pieces composing each function ipi in O( log j^i+iT+ie 2 ) ) P er function, for a total running 

time of 0(U(A,b) + lo jSt'^) ) = 0(^). 

Next, we apply Theorem 1 to show that problem (18) approximates problem (16) to 
within a factor of 1 + e. 



Lemma 6. Z\ % < Z\ % < (1 + e)Z 
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Proof. It is clear that problem (16) satisfies the assumptions needed by Theorem 1, except 
for Assumption 1 and the requirement that A be a compact set. Next, we consider these 
two assumptions. 

Because A is a polyhedron in and 4> is concave and nonnegative, problem (16) has an 
optimal solution x* at a vertex of X [HH61]. Lemma 5 ensures that size(x*) < U(A, b) for 
ie[n], and hence x* G {0} U [ 2 u ( Afi)-i_ v 2 U( - A ^- 1 - l] . Therefore, problem (16) together 
with the bounds Zj and Ui, and the optimal solution x* satisfies Assumption 1. 

If the polyhedron A is bounded, then Theorem 1 applies, and the approximation prop- 
erty follows. If A is unbounded, we add the constraints Xj < 2 U( - A ' b ^~ 1 — 1 for i G [n] to 
problems (16) and (18), obtaining the modified problems 

Z* im = min{0(x) : Ax < b,0 < x < 2 U{ - A ^- 1 - l}, (16B) 
Z* SB = min{^(x) : Ax < b, < x < 2 U ^~ 1 - l}. (18B) 
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Denote the modified feasible polyhedron by X-q. Since C X and x* G Xb, it follows 
that = Zi e and x* is an optimal solution to problem (16B). Similarly, let y* be a 

vertex optimal solution to problem (18); since X-q C X and y* G Xb, we have Z^ 8B = Z* s . 

Since X-q is a bounded polyhedron, problem (16B), together with the bounds k and 
Ui, and the optimal solution x* satisfies the assumptions needed by Theorem 1. When we 
approximate problem (16B) using the approach of equations (2)-(4), we obtain problem 
(18B), and therefore Z^ eB < Z^ 8B < (1 + e)Z* 6B . The approximation property follows. □ 

Note that it is not necessary to add the constraints X{ < 2 u ( A ^~ l — 1 to problem (16) 
or (18) when computing the piecewise-linear approximation, as the modified problems are 
only used in the proof of Lemma 6. 

If the objective functions fa of problem (16) are already piecewise-linear, the resulting 
problem (18) is again a piecewise-linear concave cost problem, but with each objective 
function ipi having at most the number of pieces given by bound (19). Since this bound 
does not depend on the functions fa, and is polynomial in the input size of the feasible 
polyhedron X and linear in 1/e, our approach may be used to reduce the number of pieces 
for piecewise-linear concave cost problems with a large number of pieces. 

When considering a specific application, it is often possible to use the application's 
structure to derive values of k and U{ that yield a significantly better bound on the number of 
pieces than the general values of equation (17). We will illustrate this with two applications 
in Sections 5 and 6. 

3.1 Extensions 

Next, we generalize this result to polyhedra that are not contained in R™ and concave 
functions that are not monotone. Consider the problem 



defined by a rational polyhedron X = {x : Ax < b} with at least one vertex, and a 
separable concave function (j> : Y — > R + . Here Y = {x : Cx < d} can be any rational 
polyhedron that contains X and has at least one vertex. Let <j){x) = Y^i=i < t ) i{ x i)i anc ^ 
assume that the functions fa are nonnegative. We assume that the input size of this 
problem is size(^4) + size(fe), and that the functions fa are given by oracles that return the 
function value and derivative in time 0(1). 

Since, unlike problem (16), this problem does not include the constraints x > 0, we 
need the following variant of Lemma 5 [see e.g. KV02]. Let V(A, b) = 4(size( J 4) + size(fr)). 

Lemma 7. If x' = (x^, . . . , x' n ) is a vertex of X, then each of its components has size(x^) < 



We approximate this problem by applying the approach of Section 2.2 as follows. If 
projj.. Y is a closed interval [c^,/3£], we let [aj,/3j] = [a^,/^]; if proj x .Y is a half-line 
K.+oo) or (-oo,#], we let = [oj, 2 V ^)- 1 ] or [a u ft] = [-2 V ^~\ #] ; and 

if the projection is the entire real line, we let [on, fa] = [-2 y ( A ' b \ 2 v ( A ' b ">] . 

If proj^,. Y is a closed interval or a half-line, we take 




min{(f)(x) : Ax < b} 



(20) 



V(A,b). 



k 



1 



and ^ = 2W)- 1 + 2 V ^)- 1 -1, 



(21) 



2 V(A,b)+V(C,d)-l _ i 
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while if projj.. Y is the entire real line, we take k = 2 V( - A ^- 1 and u { = 3 ■ 2 V ^- 1 . We 
then apply the approach of Section 2.2 as described from Assumption 2 onward, obtaining 
the piecewise-linear cost problem 

Z 2 * 2 = min{V>(x) : Ax < b}. (22) 

The number of pieces used to approximate each function fa is at most 

4 + 2 [log 1+4e+4e2 |] < 4 + 2 [log 1+4e+4e2 (2^<^(^) (2^> fe ) + 2^))) J 

< 4 + 2[log 1+4e+4e2 (2 l/ ^ b ) +v '^)2 v '^' fe ) +v '(^))l (23) 
~2V(A,b) +2V(C, d)~ 



4 + 2 



log 2 (l+4e + 4e 2 ) 



As e — >■ 0, this bound behaves as v ( A ^)+V(C,d) ^ Q ^ e th a t i n addition to the size of the 

' (log 2 e)t ' 

input and 1/e, this bound also depends on the size of C and d. Moreover, to analyze the 
time needed to compute the piecewise-linear approximation, we have to specify a way to 
compute the quantities ol i and fy. We will return to these issues shortly. 

Next, we prove that problem (22) approximates problem (20) to within a factor of 1 + e, 
by applying Lemma 4. 



Theorem 3. Z| < Z* 22 < (1 + e)Z. 



20- 



Proof. The assumptions needed by Lemma 4 are satisfied, except for Assumption 2 and the 
requirement that A be a compact set. We address these two assumptions as follows. 

First, note that problem (20) has an optimal solution at a vertex x* of A, since A is a 
polyhedron with at least one vertex, and (f> is concave and nonnegative [HH61]. By Lemma 
7, we have size(x*) < V(A,b) for i G [n], and hence x* G [-2 vi * A ^~ l + l^*^' 6 ) -1 - l] . 
We add the constraints -2 V{ - A ^~ 1 + 1 < x { < 2 V{ - A ^~ 1 — 1 for i G [n] to A, obtaining the 
polyhedron Ab and the problems 

Z 2 * 0B = min{0(x) : Ax < b, -2 v(A ' b ^ 1 + 1 < x < 2 V ^~ 1 - l}, (20B) 
Z* 2B = min{V>(x) : Ax < b, -2 V ^~ 1 + 1 < x < 2 V ^~ 1 - l}. (22B) 

It is easy to see that Z 20B = Z 20 and Z 22B = Z 22 , and that x* is an optimal solution to 
problem (20B). 

Clearly, A B is a compact set. To see that Assumption 2 is satisfied for problem (20B), 
consider the following three cases: 

1) If proj x .y = (-oo,+oo), then on = -2 v ^ b \ k = 2 V ^~\ and Ui = 3 • 2 v ^-\ 
As a result, x* - on G [-2 v ^ b )~ l + 1 + 2 y ( A > fe ), 2 v ^~ l - 1 + 2 y ( A ' 6 )] C {0} U [k,u^ 

2) If projx.y = (-oo,$], then a { = -2 V( - A ^~ 1 , and thus x* - a { > 1. On the other 
hand, = j3' { , implying that fa is a component of a vertex of Y, and thus size(/3j) < 
y(C,(i). Now, x* < pi implies that x* - at < 2 V( - C ^' 1 - 1 + 2 V( - A ^~ 1 . Since 
«i = 2 v(A, fc ) + v(c.d)-i-i and u i = 2 V ^~ 1 + 2 V ( C ^- 1 -1, we have x* -a, G {0}U[/^]. 
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3) If proj^,. Y = [a'i, +00) or proj a , i Y = [a' i: ft], then let x* = ^ with q\ > 0, and p\ and 
qi coprime integers. Similarly let a.i = and note that x* — «j = Pl92 q ~q* qi ■ Since 
aj = a^, we know that a« is a component of a vertex of Y, and hence size(aj) < V(C, d) 
and size((/2) < I^(C, <i). On the other hand, size(x*) < V(A, b), and thus size(gi) < 
V(A, b). This implies that size(gi(/2) < V(A, b) + V(C, d), and therefore either x* = oii 
or x* - oii> 2 v (Aib)+ v (c ,d)-i_ 1 - Next, since size(«i) < V(C,d) and size(x*) < V(A, b), 
we have x* - m < 2 v ^~ l + 2 V(C ^~ 1 - 2. Given that k 
u . = 2 V{A,b)-i + 2 V(C,d)-i _ ! it foUowg that x* - oh G {0} U 

Combining the three cases, we obtain x* G {ai}U[ai+/j, Oj+tij]. Similarly, we can show that 
x* G {ft }U [ft -u*, ft -Zj], and therefore x* G ({aju [ctj+Zj, a;+Ui])n({ft}U[ft-u;, ft-/;]), 
which is a subset of {q«, ft} U ([a* + k, ctj + Uj] n [ft - Uj, ft — Zj]) . 

Therefore, problem (20B), together with the quantities q« and ft, the bounds k and 
tii, and the optimal solution x* satisfies Assumption 2, and Lemma 4 applies. Using the 
approach of Section 2.2 to approximate problem (20B) yields problem (22B), which implies 
that ^Iob — ^22B — (1 + e )-^20B> an d the approximation property follows. □ 

To obtain a bound on the number of pieces that is polynomial in the size of the input and 
linear in 1/e, we can simply restrict the domain Y of the objective function to the feasible 
polyhedron X, that is let Y := X. In this case, bound (23) becomes 4 + 2 |~ log n+i^i^T 1 > 

and can be further improved to 4 + 2 |~ log (i+4 £ '+4 e 2) 1 > which behaves as l '^g^e)} as e — > 0. 

When Y = X, the time needed to compute the piecewise- linear approximation is also 
polynomial in the size of the input and linear in 1/e. The quantities a[ and ft can be 
computed by solving the linear programs minjxj : Ax < b} and maxjxj : Ax < b}. 
Recall that this can be done in polynomial time, for example by the ellipsoid method 
[see e.g. GLS93, KV02], and denote the time needed to solve such a linear program by 
7lp(^4, b). After computing the quantities ol i and ft, we can compute all the quantities 
a, and ft in 0(V(A, &)), all the bounds k and Ui in 0(V(A, &)), and the pieces composing 
each function (pi in O( log (^+4^4.4^) ) P er function. The total running time is therefore 

0(nT LP (A,b) + V(A,b) r^ffff^ ) = 0(nT LP (A,b) + ^). 

In many applications, the domain Y of the objective function has a very simple structure 
and the quantities a' { and ft are included in the input, as part of the description of the 
objective function. In this case, using bound (23) directly may yield significant advantages 
over the approach that lets Y := X and solves In linear programs. Bound (23) can be 
improved to M \ 2 ^^^^\ , and as e -+ it behaves as n^H^W^ 
Since a[ and ft are part of the input, the improved bound is again polynomial in the size 
of the input and linear in 1/e. 



4 Algorithms for Concave Cost Problems 

Although concave cost problem (16) can be approximated efficiently by piecewise-linear cost 
problem (18), both the original and the resulting problems contain the set cover problem 
as a special case, and therefore are NP-hard. Moreover, the set cover problem does not 
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have an approximation algorithm with a certain logarithmic factor, unless P = NP [RS97]. 
Therefore, assuming that P 7^ NP, we cannot develop a polynomial-time exact algorithm or 
constant factor approximation algorithm for problem (18) in the general case, and then use 
it to approximately solve problem (16). In this section, we show how to use our piecewise- 
linear approximation approach to obtain new algorithms for concave cost problems. 

We begin by writing problem (18) as an integer program. Several classical methods for 
representing a piecewise-linear function as part of an integer program introduce a binary 
variable for each piece and add one or more coupling constraints to ensure that any feasible 
solution uses at most one piece [see e.g. NW99, CGM03]. However, since the objective 
function of problem (18) is also concave, the coupling constraints are unnecessary, and we 
can employ the following fixed charge formulation. This formulation has been known since 
at least the 1960s [e.g. FLR66] . 

n Pi 

min EE + (24a) 

s.t. Ax < b, (24b) 

Pi 

X * = E^' i£H (24c) 

p=0 

0<yf<B iZ f, i€[n],p€{0,...,Pi}, (24d) 

2f€{0,l}, i€[n],p€{0,...,Pi}. (24e) 

Here, we assume without loss of generality that ipi(0) = 0. The coefficients Bi are chosen 
so that Xi < B{ at any vertex of the feasible polyhedron X of problem (18), for instance 
Bi = 2 U ( A ^- 1 - 1. 

A key advantage of formulation (24) is that, in many cases, it preserves the special 
structure of the original concave cost problem. For example, when (16) is the concave cost 
multicommodity flow problem, (24) becomes the fixed charge multicommodity flow problem, 
and when (16) is the concave cost facility location problem, (24) becomes the classical facil- 
ity location problem. In such cases, (24) is a well-studied discrete optimization problem and 
may have a polynomial-time exact algorithm, fully polynomial-time approximation scheme 
(FPTAS), polynomial-time approximation scheme (PTAS), approximation algorithm, or 
polynomial-time heuristic. 

Let 7 > 1. The next lemma follows directly from Lemma 6. 

Lemma 8. Let x' be a ^-approximate solution to problem (18), that is x' G X and Z\% < 
ip{x') < 7^i 8 . Then x' is also a (1 + e)j approximate solution to problem (16), that is 
Z* m <<t>(.x>)< (l + eh^Y 

Therefore, a 7-approximation algorithm for the resulting discrete optimization problem 
yields a (1 + 6)7 approximation algorithm for the original concave cost problem. More 
specifically, we compute a 1 + e piecewise-linear approximation of the concave cost problem; 
the time needed for the computation and the input size of the resulting problem are both 
bounded by 0( nU ^' b ^ . Then, we run the 7-approximation algorithm on the resulting 
problem. The following table summarizes the results for other types of algorithms. 
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When the resulting discrete 
optimization problem has a ... 

Polynomial-time exact algorithm 

FPTAS 

PTAS 

7-approximation algorithm 
Polynomial-time heuristic 



We can obtain for the original 
concave cost problem a ... 

FPTAS 
FPTAS 
PTAS 

(1 + 6)7 approximation algorithm 
Polynomial-time heuristic 



In conclusion, we note that the results in this section can be adapted to the more general 
problems (20) and (22). 



5 Concave Cost Multicommodity Flow 

To illustrate our approach on a practical problem, we consider the concave cost multicom- 
modity flow problem. Let (V, E) be an undirected network with node set V and edge set 
E, and let n = \ V\ and m = \E\. This network has K commodities flowing on it, with the 
supply or demand of commodity k at node i being b\. If b\ > then node i is a source 
for commodity k, while b\ < indicates a sink. We assume that each commodity has one 
source and one sink, that the supply and demand for each commodity are balanced, and 
that the network is connected. 

Each edge £ E has an associated nondecreasing concave cost function <pij : R + — > 

R+. Without loss of generality, we let <f>ij(0) = for {i,j} € E. For an edge {i,j} G E, let 
xfj indicate the flow of commodity k from i to j, and the flow in the opposite direction. 
The cost on edge is a function of the total flow of all commodities on it, namely 

(pij {^2k=i( x ij + x ji))- The goal is to route the flow of each commodity so as to satisfy all 
supply and demand constraints, while minimizing total cost. 

A mathematical programming formulation for this problem is given by: 

Z* 25 = min hi fe(4" + 4)) ' ( 25a ) 

{i,j}£E \k=l / 

s.t. Yl 4~ E x % = h ^ ieV,ke[K], (25b) 

{i,j}eE {j,i}eE 

>0, {i,j}€E,k€[K\. (25c) 

Let B k = J2i b k >o an( ^ B = Ylk=i B k - For simplicity, we assume that the coefficients b\ 
are integral. 

A survey on concave cost network flows and their applications is available in [GP90]. 
Concave cost multicommodity flow is also known as the buy-at-bulk network design problem 
[e.g CK05, CHKS06]. Concave cost multicommodity flow has the Steiner tree problem as a 
special case, and therefore is NP-hard, and does not have a polynomial-time approximation 
scheme, unless P = NP [BP89, ALM+98]. Moreover, concave cost multicommodity flow 
does not have an ©(log 1 / 2-6 nj approximation algorithm for e' arbitrarily close to 0, unless 
NP C ZTIME^p 01 ^'?) [And04]. 
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Problem (25) satisfies the assumptions needed by Lemma 6, since we can handle the 
cost functions 4>ij {J2k=i( x ij + x ji)) by introducing new variables £y = Ylk=i( x ij + x ji) f° r 
{i,j} £ E. We apply the approach of equations (17)— (19) to approximate this problem to 
within a factor of 1 + e, use formulation (24) to write the resulting problem as an integer 
program, and disaggregate the integer program, obtaining: 

K 

-^26 = mnl E fijp z ijp+ E ^ s ijp( x ijp + x jip)> (26a) 
{ijjfieE' {i,j, P }£E> k=l 

s.t. Yl 4p~ E x %P = b t ieV,ke[K], (26b) 

{i,j,p}£E> {j,i,p}GE> 

< x k jp , x) ip < B k y ijp , {i, j, p}eE',k£[K], (26c) 

y ijp e{0,l}, {i,j, P }eE'. (26d) 

This is the well-known fixed charge multicommodity flow problem, but on a new network 
(V, -E") with (P + l)m edges, for a suitably defined P. Each edge {i, j} in the old network 
corresponds to P + 1 parallel edges {i, in the new network, with p being an index to 
distinguish between parallel edges. For each edge {i,j,p} G E', the coefficient fij p can be 
interpreted as its installation cost, and Sij p as the unit cost of routing flow on the edge once 
installed. The binary variable yij p indicates whether edge {i, j,p} is installed. 

For a survey on fixed charge multicommodity flow, see [AMOR95, BMM97]. This 
problem is also known as the uncapacitated network design problem. The above hardness 
results from [BP89, ALM+98] and [And04] also apply to fixed charge multicommodity flow. 

By bound (19), P < |~ log T[+^jjjP) l > with A and b' being the constraint matrix and 
right-hand side vector of problem (25). However, we can obtain a much lower value of P 
by taking problem structure into account. Specifically, we perform the approximation with 
k = l and Ui = B, which results in P < \ log J°ll? +4e 2) \ ■ 

Lemma 9. Z% 5 < Z* 2% < (1 + e)Z 2 * 5 . 

Proof. Since the objective is concave and nonnegative, problem (25) has an optimal solution 
at a vertex z of its feasible polyhedron [HH61]. In z, the flow of each commodity occurs on 
a tree [see e.g. Sch03], and therefore, the total flow Ylk=i( z ij + z ji) on an y e dge {i, j} G E 
is in {0} U [1, B}. The approximation result follows from Theorem 1. □ 



5.1 Computational Results 

We present computational results for problems with complete uniform demand — there is a 
commodity for every ordered pair of nodes, and every commodity has a demand of 1. We 
have generated the instances based on [BMW89] as follows. To ensure feasibility, for each 
problem we first generated a random spanning tree. Then we added the desired number 
of edges between nodes selected uniformly at random. For each number of nodes, we 
considered a dense network with n{n — l)/4 edges (rounded down to the nearest multiple 
of 5), and a sparse network with 3ro edges. For each network thus generated, we have 
considered two cost structures. 
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TT 


n 


m 


K 


Flow 


Pieces 








Variables 


1 


10 


30 


90 


8,100 


41 


2 


20 


60 


380 


22,800 


77 


3 


20 


95 


380 


36,100 


77 


4 


30 


90 


870 


78,300 


98 


5 


30 


215 


870 


187,050 


98 


6 


40 


120 


1,560 


187,200 


113 


7 


40 


390 


1,560 


608,400 


113 


8 


50 


150 


2,450 


367,500 


124 


9 


50 


610 


2,450 


1,494,500 


124 


10 


60 


180 


3,540 


637,200 


133 


11 


60 


885 


3,540 


3,132,900 


133 


12 


70 


210 


4,830 


1,014,300 


141 


13 


70 


1,205 


4,830 


5,820,150 


141 


14 


80 


240 


6,320 


1,516,800 


148 


15 


80 


1,580 


6,320 


9,985,600 


148 



Table 1: Network sizes. The column "Pieces" indicates the number of pieces in each 
piecewise linear function resulting from the approximation. 

The first cost structure models moderate economies of scale. We assigned to each edge 
{i,j} £ E a cost function of the form <pij{£,ij) = a + b(^ij) c , with a, 6, and c randomly 
generated from uniform distributions over [0.1,10], [0.33,33.4], and [0.8,0.99]. For an av- 
erage cost function from this family, the marginal cost decreases by approximately 30% as 
the flow on an edge increases from 25 to 1,000. The second cost structure models strong 
economies of scale. The cost functions are as in the first case, except that c is sampled 
from a uniform distribution over [0.0099,0.99]. In this case, for an average cost function, 
the marginal cost decreases by approximately 84% as the flow on an edge increases from 
25 to 1,000. Note that on an undirected network with n nodes, there is an optimal solution 
with the flow on each edge in {0, 2, ... , n(n — 1)}. 

Table 1 specifies the problem sizes. Note that although the individual dimensions of the 
problems are moderate, the resulting number of variables is large, since a problem with n 
nodes and m edges yields n(n — l)m flow variables. The largest problems we solved have 
80 nodes, 1,580 edges, and 6,320 commodities. To approach them with an MIP solver, 
these problems would require 1,580 binary variables, 9,985,600 continuous variables and 
10,491,200 constraints, even if we replaced the concave functions by fixed charge costs. 

We chose e = 0.01 = 1% for the piecewise linear approximation. Here, we have been 
able to reduce the number of pieces significantly by using the tight approximation guarantee 
of Theorem 1 and the problem-specific bound of Lemma 9. After applying our piecewise 
linear approximation approach, we have reduced the number of pieces further by noting 
that for low argument values, our approach introduced tangents on a grid denser than the 
uniform grid 2,4, 6, . . . For each problem, we have reduced the number of pieces per cost 
function by approximately 47 by using the uniform grid for low argument values, and the 
grid generated by our approach elsewhere. 

We used an improved version of the dual ascent method described by Balakrishnan 
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Moderate economies of scale Strong economies of scale 





Time 


boi. 






Time 


boi. 








Edges 


£DA% 


£ALL% 


Edges 


£DA% 


£ALL% 


1 


0.13s 


14 


0.41 


1.41 


0.19s 


9 


0.35 


1.35 


2 


3.17s 


31 


1.45 


2.46 


3.99s 


19 


1.06 


2.07 


3 


3.50s 


25.3 


1.20 


2.21 


5.37s 


19 


3.38 


4.42 


4 


18.5s 


43.7 


1.94 


2.96 


11.7s 


29 


1.18 


2.20 


5 


31.3s 


44 


2.16 


3.19 


21.1s 


29 


3.50 


4.54 


6 


lm23s 


61.7 


2.47 


3.49 


27.1s 


39 


2.20 


3.22 


7 


2m6s 


59 


3.24 


4.28 


lmlls 


39 


3.17 


4.21 


8 


3m45s 


79 


2.22 


3.24 


lmlls 


49 


3.42 


4.46 


9 


6ml9s 


74.7 


3.10 


4.13 


2m48s 


49 


4.22 


5.26 


10 


8m45s 


95 


2.58 


3.61 


2mls 


59 


3.27 


4.30 


11 


18m52s 


95.7 


3.64 


4.68 


5m59s 


59 


4.25 


5.29 


12 


16m44s 


101.7 


2.85 


3.87 


2m35s 


69 


3.77 


4.81 


13 


39ml8s 


115.7 


4.19 


5.24 


9m43s 


69 


4.98 


6.03 


14 


32m46s 


127.7 


2.82 


3.84 


4m25s 


79 


4.10 


5.14 


15 


lh24m 


143 


5.24 


6.29 


15m50s 


79 


5.67 


6.73 


Average 




2.63 


3.66 






3.24 


4.27 



Table 2: Computational results. The values in column "Sol. Edges" represent the number 
of edges with positive flow in the obtained solutions. 

et al. [BMW89] (also known as the primal-dual method [see e.g. GW97]) to solve the 
resulting fixed charge multicommodity flow problems. The method produces a feasible 
solution, whose cost we denote by Z^-, to problem (26) and a lower bound on the 

optimal value of problem (26). As a result, for this solution, we obtain an optimality 

z DA 

gap eDA = — 1 with respect to the piecewise linear problem, and an optimality gap 

call = (1 + + ^da) — 1 with respect to the original problem. 

Table 2 summarizes the computational results. We performed all computations on an 
Intel Xeon 2.66 GHz. For each problem size and cost structure, we have averaged the 
optimality gap, computational time, and number of edges in the computed solution over 3 
randomly-generated instances. 

We obtained average optimality gaps of 3.66% for problems with moderate economies 
of scale, and 4.27% for problems with strong economies of scale. This difference in average 
optimality gap is consistent with computational experiments in the literature that analyze 
the difficulty of fixed charge problems as a function of the ratio of fixed costs to variable 
costs [BMW89, HS89]. Note that the solutions to problems with moderate economies of 
scale have more edges than those to problems with strong economies of scale; in fact, in 
the latter case, the edges always form a tree. 

To the best of our knowledge, the literature does not contain exact or approximate 
computational results for concave cost multicommodity flow problems of this size. Bell and 
Lamar [BL97] introduce an exact branch-and-bound approach for single- commodity flows, 
and perform computational experiments on networks with up to 20 nodes and 96 edges. 
Fontes et al. [FHC03] propose a heuristic approach based on local search for single-source 
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single-commodity flows, and present computational results on networks with up to 50 nodes 
and 200 edges. They obtain average optimality gaps of at most 13.81%, and conjecture that 
the actual gap between the obtained solutions and the optimal ones is much smaller. Also 
for single-source single-commodity flows, Fontes and Goncalves [FG07] propose a heuristic 
approach that combines local search with a genetic algorithm, and present computational 
results on networks with up to 50 nodes and 200 edges. They obtain optimal solutions for 
problems with 19 nodes or less, but do not provide optimality gaps for larger problems. 



6 Concave Cost Facility Location 

In the concave cost facility location problem, there are m customers and n facilities. Each 
customer i has a demand of di > 0, and needs to be connected to a facility to satisfy it. 
Connecting customer i to facility j incurs a connection cost of Cijdi, the connection costs 
Cij are nonnegative and satisfy the metric inequality. 

Let Xij = 1 if customer i is connected to facility j, and = otherwise. Then the total 
demand satisfied by facility j is Y^ILi d% x ij- Each facility j has an associated nondecreasing 
concave cost function <j>j : R + — > R + . We assume without loss of generality that 4>j (0) = 
for j € [n\. At each facility j we incur a cost of <pj (Y^Li diCij)- The goal is to assign each 
customer to a facility, while minimizing the total connection and facility cost. 

The concave cost facility location problem can be written as a mathematical program: 

n / m \ n rn 

Z* 21 = min ^ 4>j I ^2 diXi i ) + c v d i x v^ ( 27a ) 

j=i \i=i J j=i i=i 

n 

s.t. ^2 Xij = 1, i e [n], (27b) 

3=1 

Xij>0, ie[m],j e[n]. (27c) 

Let D = di. We assume that the coefficients and di are integral. 

Concave cost facility location has been studied since at least the 1960s [KH63, FLR66]. 
Mahdian and Pal [MP03] developed a 3 + e' approximation algorithm for this problem, for 
any e' > 0. When the problem has unit demands, that is d\ = ■ ■ ■ = d m = 1, a wider 
variety of results become available. In particular, Hajiaghayi et al. [HMM03] obtained a 
1.861-approximation algorithm. Hajiaghayi et al. [HMM03] and Mahdian et al. [MYZ06] 
described a 1.52-approximation algorithm. 

Concerning hardness results, concave cost facility location contains the classical facility 
location problem as a special case, and therefore does not have a polynomial-time approxi- 
mation scheme, unless P = NP, and does not have a 1.463-approximation algorithm, unless 
NP C DTIME(n°( lo g lo s n )) [GK99]. 

As before, problem (27) satisfies the assumptions needed by Lemma 6. We apply the 
approach of equations (17)-(19) to approximate it to within a factor of 1+e, use formulation 
(24) to write the resulting problem as an integer program, and disaggregate the integer 
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program: 



n P n P m 




mm 



£ £ fjyj + £ £ £K + 



(28a) 



j=l p=0 j=l p=0 i=l 



n P 



s .t. ££4 = 1, 



i G [mj, 



(28b) 



o<4-<i£, 

^€{0,1}, 



i£ [m],j G [n],p€{0,...,P}, 
J£ [n],p€{0,...,P}. 



(28c) 
(28d) 



We have obtained a classical facility location problem that has m customers and Pn fa- 
cilities, with each facility j in the old problem corresponding to P facilities (j,p) in the 
new problem. Each coefficient /J can be viewed as the cost of opening facility (j,p), while 
s p - + Cij can be viewed as the unit cost of connecting customer i to facility (j,p). Note 
that the new connection costs + Cij satisfy the metric inequality The binary variable y? 
indicates whether facility (j,p) is open. 

Problem (28) is one of the fundamental problems in operations research [CNW90, 
NW99]. Hochbaum [Hoc82] showed that the greedy algorithm is a O(logn) approximation 
algorithm for it, even when the connection costs c^- are non-metric. Shmoys et al. [STA97] 
gave the first constant-factor approximation algorithm for this problem, with a factor of 
3.16. More recently, Mahdian et al. [MYZ06] developed a 1.52-approximation algorithm, 
and Byrka [Byr07] obtained a 1.4991-approximation algorithm. The above hardness results 
of Guha and Khuller [GK99] also apply to this problem. 

Bound (19) yields P < |~ i og 7i+S+4e^) l ' w ^ ^ anc ^ ^ being the constraint matrix and 
right-hand side vector of problem (27). We can obtain a lower value for P by taking k = 1 
and ui = D, which yields P < f log 2 (l+^e-h^^) 1 " P ro °f °f the following lemma is similar 
to that of Lemma 9. 

Lemma 10. Z| 7 < Z| 8 < (1 + e)Z| 7 . 

Combining our piecewise-linear approximation approach with the 1.4991-approximation 
algorithm of Byrka [Byr07], we obtain the following result. 

Theorem 4. There exists a 1.4991+e' approximation algorithm for the concave cost facility 
location problem, for any e' > 0. 

We can similarly combine our approach with other approximation algorithms for classi- 
cal facility location. For example, by combining it with the 1.52-approximation algorithm 
of Mahdian et al. [MYZ06], we obtain a 1.52 + e' approximation algorithm for concave cost 
facility location. 
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